AITopics | egocentric dataset

Collaborating Authors

egocentric dataset

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Learning State-Aware Visual Representations from Audible Interactions

Neural Information Processing SystemsDec-24-2025, 20:26:19 GMT

We propose a self-supervised algorithm to learn representations from egocentric video data. Recently, significant efforts have been made to capture humans interacting with their own environments as they go about their daily activities. In result, several large egocentric datasets of interaction-rich multi-modal data have emerged. However, learning representations from videos can be challenging. First, given the uncurated nature of long-form continuous videos, learning effective representations require focusing on moments in time when interactions take place.

audible interaction, learning state-aware visual representation, name change, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Vision (0.77)

Add feedback

OpenEgo: A Large-Scale Multimodal Egocentric Dataset for Dexterous Manipulation

Jawaid, Ahad, Xiang, Yu

arXiv.org Artificial IntelligenceSep-9-2025

Egocentric human videos provide scalable demonstrations for imitation learning, but existing corpora often lack either fine-grained, temporally localized action descriptions or dexterous hand annotations. We introduce OpenEgo, a multimodal egocentric manipulation dataset with standardized hand-pose annotations and intention-aligned action primitives. OpenEgo totals 1107 hours across six public datasets, covering 290 manipulation tasks in 600+ environments. We unify hand-pose layouts and provide descriptive, timestamped action primitives. To validate its utility, we train language-conditioned imitation-learning policies to predict dexterous hand trajectories. OpenEgo is designed to lower the barrier to learning dexterous manipulation from egocentric video and to support reproducible research in vision-language-action learning. All resources and instructions will be released at www.openegocentric.com.

artificial intelligence, dataset, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2509.05513

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Robots > Manipulation (0.61)

Add feedback

Learning State-Aware Visual Representations from Audible Interactions

Neural Information Processing SystemsJan-18-2025, 00:25:33 GMT

audible interaction, egocentric dataset, learning state-aware visual representation, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Vision (0.82)

Add feedback